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Abstract 



Given a polygonal curve P, a pointset S, and an s > 0, we study the problem of finding a 
polygonal curve Q whose vertices are from S and has a Frechet distance less or equal to e to 
curve P. In this problem, Q must visit every point in S and we are allowed to reuse points of 
pointset in building Q. First, we show that this problem in NP-Complete. Then, we present a 
polynomial time algorithm for a special cases of this problem, when P is a convex polygon. 
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1 Introduction 

Geometric pattern matching and recognition has many applications in geographic information 
systems, computer aided design, molecular biology, computer vision, traffic control, medical 
imaging etc. Usually these patterns consist of line segments and polygonal curves. Frechet 
metric is one of the most popular ways to measure the similarity of two curves. An intuitive 
way to illustrate the Frechet distance is as follows. Imagine a person walking his/her dog, 
where the person and the dog, each travels a pre-specified curve, from beginning to the end, 
without ever letting go off the leash or backtracking. The Frechet distance between the two 
curves is the minimal length of a leash which is necessary. The leash length determines how 
similar the two curves are to each other: a short leash means the curves are similar, and a 
long leash means that the curves are different from each other. 

Two problem instances naturally arise: decision and optimization. In the decision problem, 
one wants to decide whether two polygonal curves P and Q are within e Frechet distance to 
each other, i.e., if a leash of given length e suffices. In the optimization problem, one wants 
to determine the minimum such e. In pQ, Alt and Godau gave an 0(n 2 ) algorithm for the 
decision problem, where n is the total number of segments in the curves. They also solved 
the corresponding optimization problem in 0(n 2 log n) time. 

In this paper, we address the following variant of the Frechet distance problem. Consider 
a point set Sctf and a polygonal curve P in K d , for d ^ 2 being a fixed dimension. The 
objective is to decide whether there exists a polygonal curve Q within an e-Frechet distance 
to P such that the vertices of Q are all chosen from the pointset S. Curve Q has to visit 
every point of S and it can reuse points. We show that this problem is NP-Complete. We 
then present a polynomial time decision algorithm for a special case of the problem where 
the input curve P is a convex polygon. 
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This paper is organized as follows: in Section [2] we establish our NP-Complete proof for 
the general case of the problem. In Section [3] we investigate the special case of the problem. 
Finally, we conclude in Section [4] with some open problems. 



2 General Case is NP-Complete 
2.1 Preliminaries 

Given two curves a, (3 : [0, 1] — > M. d , the Frechet distance between a and (3 is defined as 
5p{a,P) = inf CT T max te [ 0j i] ||a(o"(t)), / 8(r(t))||, where a and r range over all strictly monotone 
increasing continuous functions. The following two observations are immediate. 

► Observation 1. Given four points a, b,c,de R d , if \\ab\\ ^ e and 1 1 cd| ^ e, then 5f{o&, bd) ^ 
e. 

► Observation 2. Let ai, a 2 , Pi, and /3 2 be four curves such that #f(oi,/3i) ^ e and 
<5_p(a2, /3 2 ) e. If the ending point of ct\ (resp., /3i), is the same as the starting point of a 2 
(resp., P 2 ): then Sp(ai + a 2 , Pi + P 2 ) ^ s, where + denotes the concatenation of two curves. 



Notations. We denote by P =< P\p 2 Pz---p n >j a polygonal curve P with vertices P1P2 ■ ■ -Pn 
in order and by start(P) and end(P), we denote the starting and ending point of P, 
respectively. For a curve P and a point x, by P © x, we mean connecting end(P) to point x 
(we use the same notation P © Q to show the concatenation of two curves P and Q). Let 
M(ab) denote the midpoint of the line segment ab. For a point q in the plane, let x(q) and 
y(q) denote the x and y coordinate of q 7 respectively. 

For two line segments ab and cd, with ab H cd, we denote the intersection point of them. 
Also, for a point a and a line segment 6c, a _L &c denotes the point on 6c located on the 
perpendicular from a to 6c. Also, dist(a, be) denotes the distance between a and segment 6c. 

► Definition 1. Given a pointset S in the plane, let Curves(S) be a set of polygonal curves 
Q =< qiq 2 ...q n > where: 

Vq,; : qi G S and 
Va e 5 : 3qi s.t. qi = a 

► Definition 2. Given a pointset S, a polygonal curve P and a distance e, a polygonal curve 
Q is called feasible if: Q € Curves(S) and 5f(P, Q) < £■ 

We show that the problem of deciding whether a feasible curve exists or not is NP- 
completc. It is easy to see that this problem is in NP, since one can polynomially check 
whether Q G Curves(S) and also Sp{P, Q) < e, using the algorithm in [lj. 



2.2 Reduction Algorithm 

We reduce in Algorithm [T] an instance of 3CNF-SAT formula <j> to an instance of our problem. 
The input is a boolean formula (f> with k clauses C\ , C'2, ■ ■ ■ , Ck and n variables x± , x 2 , ■ ■ ■ , x n 
and the output is a pointset S, a polygonal curve P in the plane and a distance e = 1. 

We construct the pointset S as follows. For each clause Cj, 1 < i < k, in the formula </>, 
we place three points {si, gi, c^}, refereed by cli points, in the plane, which are computed in 
the i-th iteration of Algorithm [l] (from line [3] to line 13 1. We define Oi to be M(sjgi). By 
SQj, 1 < i < k, we denote a square in the plane, centered at Oi, with diagonal slgl. We refer 
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to SQ;, 1 < i < k, as c-squares. For an example of a pointset S corresponding to a formula, 
see Figure [T] 

Our reduction algorithm constructs the polygonal curve P through n iterations. In the 
i-th iteration, 1 < i < n, it builds a subcurve U corresponding to a variable X{ in the formula 
<j) and appends that curve to P. In addition to those n subcurves, two curves l n +i and l n +2 
are appended to P. We will later discus the reason we add those two curves to P. Every 
subcurve li of P starts at point u and ends at point v. Furthermore, every li goes through 
c-squares SQi to SQ& in order, enters each SQj from the side cjs] and exists that square from 
the side cjgj (for an illustration, see Figure [T]) . Curve U itself is built incrementally through 
iterations of the loop at line 29 of Algorithm [l] In the j-th iteration, when li goes through 



SQj, three points, which are within SQj, are added to U (these three points are computed 



through lines 30 to 35). Next, before U reaches to SQj+i, two points, denoted by ctj and (3j, 



are added to that curve (these two points are computed in lines 37 and 38 1 . 

Since each ^ corresponds to variable Xi in our approach, this is how we simulate 1 or 
values of xf. Consider a point object Ol traversing li, from starting point u to ending point v. 
Consider another point object O2 which wants to walk from u to v on a path whose vertices 
are from points in S and it wants to stay in distance one to Ol- We will show that by our 
construction, object O2 has two options, cither taking the path A =< u, s±, g2, S3 ... w > or 
the path B =< u, g±, S2> <?3 • ■ ■ v > (See Figure [I] and [6] for an illustration). Choosing path A 
by O2 means Xi = 1 and choosing path B means Xj = 0. We first prove in Lemma [3] that 
Sp(li,A) < 1 and in Lemma|4]that 5p(l i7 B) < 1. Furthermore, by Lemma[5] we prove that 
that as soon as O2 chooses the path A at point u to walk towards v, it can not switch to any 
vertex on path B. In addition, in lemmas [6] and [7] we prove that if Xi appears in the clause 
Cj, O2 could visit point Cj via the path A and not B. In contrast, when ^Xi appears in the 
clause Cj, O2 could visit point Cj via the path B and not A. However, when both of Xi and 
-^Xi does not appear in Cj, O2 can not take A or B to visit Cj. 

► Lemma 3. Consider any subcurve li =< u . . .v >, 1 < i < n + 2, which is built through 
lines [^5| to [JO] of Algorithm^ Let A be the polygonal curve < usig2S3g4..v >. Then, 
S F (h,A) < 1. 

Proof. We prove the lemma by induction on the number of segments along A. Consider two 
point objects Ol and Oa traversing l t and A, respectively (Figure [I] depicts an instance of h 
and A). We show that Ol and Oa can walk their respective curve, from the beginning to 
end, while keeping distance 1 to each other. 

The base case of induction trivially holds as follows (see Figure [7] for an illustration) : 
Table I] lists pairwise location of Ol and 0,4, where the distance of each pair is at most 1. 
Hence, 0^ can walk from u to s\ on the first segment of A (segment us\), while keeping 
distance < 1 to 0^. 

Assume inductively that Ol and Oa have feasibly walked along their respective curves, 
until 0a reached Sj. Then, as the induction step, we show that 0a can walk to gj+i and 
then to Sj +2 , while keeping distance 1 to 0^. Table [T] lists pairwise location of 0^ and 0^ 
such that 0a could reach Sj + 2- One can easily check that the distance between pair of points 
in that table is at most one. (For an illustration, see Figure |8|. 

Finally, if k is an odd number, then s^h is the last segment along B, otherwise, g~k$ is 
the last one. In any case, that edge crosses the circle 23 (77, 1), where 77 is the last vertex of 1$ 
before v (point 77 is computed in line 14 of Algorithm [lj . Therefore, 0a can walk to v, while 



keeping distance 1 to 0l- 

•4 -4 
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Algorithm 1 Reduction Algorithm 

Input: 3SAT formula <f> with k clauses C\ . . .Ck and n variables x\ . . . x. 
Construct pointset S: 



1: Si- 

2: 91 = (1,1) 

3: for j = 1 to fe do 

4: Si <- (x{ gj ) -2,y(gj) -2) 

5: o 3 <- M(sjgj) 

6: if (j is odd) then 

7: Cj <- (x{s j ),y{g j )), w 3 <- (x(oj) + \,y{oj) - \) 

8: g j+1 <- (z(sj) + | + 8, y( Sj ) + | + 15) 

9: else 

10: Cj <r- (x(g j ),y{s j )), Wj <r- (x(Oj) - \,y{c-j) + \) 

11: , 9j + 1 «- (*(«,•) + | + 15, J/( Sj ) + | + 8) 

12: Zj = M(cjwj) 

13: 5 = S U {Sj, gj, Cj} 

14: if (fc is odd) then 

15: r] <- (x{o k ) + l,y(o k ) +4) 

16: v <- (ar(o fc ) + l,y(ofe) + 9) 

17: else 

18: 77 <- (x(o fc ) + 4,y(o fe ) + 1) 

19: V «- (x(Ofc)+9,I/(Ofc) + l) 

20: t «- (ar(u),2/(ti) - 20) 
21: li = (-9,-1) 
22: 5 = SU {«, M} 



Construct polygonal curve P: 

23: P <- 

24: P P © i 

25: for z = 1 to n + 2 do 



26: Zj «- 

27: Zj «- Zj 8 U 

28: Zj Zj (-4, -1) 

29: for j = 1 to k do 

30: if (a;j e Cj and j is odd ) or (->Xi G Cj and j is even ) then 

31: h <- u © M(sjcj) © c-,- e Wj 

32: else if (-iXj G Cj and j is odd ) or (a;j G Cj and j is even ) then 

33: Zj <- h © Wj © Cj 8 M(gjcj) 

34: else 

35: Zj -S— ZjWj © Cj u>j 

36: if j ^ k then 

37: Qj = | 5j + ififj+i 

38: j3j = gSj + gSj+i 

39: h <- k® aj © ySj 

40: Zj «- Zj © T) © U 

41: P^P®k 

42: P^P®t 



43: return pointset S 1 , polygonal curve P and distance e = 1 
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(-4,-1) 

Figure 1 Assume that formula <f> has four clauses Ci,C%,C3 and C4, where the occurrence of 
variable Xi in those clauses is: ->Xi G C\, -^Xi G C'2, Xi G C3 and Xi G C4. For each clause Cj, the 
reduction algorithm places three point Sj , <?i and c, in the plane. Blue curve is an example of curve 
li which corresponds to variable Xi in <j>. Red curve is curve A. 



► Lemma 4. Consider any subcurve U —< u, v >, 1 < i < n + 2, constructed through lines 
25 to 40 of Algorithm^ Let B be the polygonal curve < ugxS2g^s^..v >. Then, 8p{li,B) < I. 



Proof. The proof is analogous to the proof of Lemma [3] see appendix. 

► Lemma 5. Consider any curve i,- C P, 1 < i < n + 2, and a point object 0^ walking from 
u to v on li. Also, imagine two point objects 0^ and Ob, walking on curves A and B (from 
Lemmas^and^, respectively while keeping distance 1 to 0^. Then, if 0^ switches to path 
B or Ob switches to path A, they loose distance 1 to 0^. 

Proof. See Appendix. < 



For 1 < i < k, if i is an odd number, set Oj = Si and bi = g t and if i is an even number, 
set cij = c/i and bi = s.j . 

► Lemma 6. Consider the curve A —< 110,10,203 . . . a^v > from Lemma [3} Let A\ be a 

subcurve of A which starts at u and ends at a,j, 1 < j < k. Furthermore, let A2 be a 
subcurve of A which starts at aj and ends at v. For any curve U , 1 < i < n + 2, if Xi € Cj, 
5f{Ai Cj ® A2, k) < e. Similarly, consider the curve B —< ub^b^ . . . b^v > from Lemma 
Let B\ be a subcurve of B which starts at u and ends at bj, 1 < j < k. Furthermore, let 
B2 be a subcurve of B which starts at bj and ends at v. For any curve li , 1 < i < n + 2, if 
-^x l £ Cj, 6 F (Bi 9 Cj 8 B 2 , h) < e. 
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Proof. When Xi appears in clause Cj, point z — M(cjOj) is a vertex of U. Since ||cjOj|| = 2 
and z is the midpoint of cjaj, Ol can wait at z while 0,4 visits Cj When appears in 
clause Cj, point z = M(cjbj) is a vertex of 1^. Since ||cj6j| — 2 and z is the midpoint of Cjbj, 
Ol can wait at z while Oa visits Cj and comes back to bj -4 

► Lemma 7. Consider curve A (respectively, B). For any curve h, 1 < i < n + 2, if Xi Cj 

and ~^Xi (£ Cj, then A (resp., B) can not be modified to visit Cj. 

Proof. This is because dist(vjj, ajCj) > 1 and dist(wj,bjCj) > 1. -4 



► Theorem 8. Given a formula (f> with k clauses C%, C<x, . . . , C^ and n variables x\, x% . . . , x n , 

as input let curve P and pointset S be the output of Algorithm^ Then, <p is satisfiable iff a 
curve Q G Curves(S) exists such that 8p(P,Q) < 1. 

Proof. For (=^): Assume that formula 4> is satisfied. In Algorithm [2] we show that knowing 
the truth value of the literals in (f>, we can build a curve Q which visits every point in S and 
8 F (P,Q) < 1. 

Algorithm 2 Build a feasible curve Q 

Input: Truth table of variables x\, x%, . . . , x n in <f> 
Q 4- 
Q Q®t 
for i = 1 to n do 
if (xi = 1) then 

7r <— < uaia2a,3 . . . atv > 
for all Cj clauses, if xi £ Cj do 

let 7Ti be subcurve of it from u to aj 
let 7T2 be subcurve of it from aj to v 

IT <— 1T\ Cj 7T 2 
Q 4- Q®TT 

else 

7r ^ — < ubib 2 b 3 . . . bkV > 
for all Cj clauses, if -<Xi € Cj do 
let tt\ be subcurve of tt from u to bj 
let 7T2 be subcurve of tt from 6j to v 

TT <— 7Ti Cj 7T 2 
Q <- Q07T 

Q <- Q©t 
Q 4- Q@ < uaia 2 a^ . . . akv > 
Q 4- Q@t 

Q <- Q® < ubib 2 b?,...b k v > 
Q <r- Q@t 
return Q 



First we show 5f(P,Q) < 1, where Q is the output curve of Algorithm [2j Recall that 
by Algorithm [l] curve P includes n subcurves U each corresponds to a variable Xi. Both 
curves P and Q start and end at a same point t. For each curve tt which is appended to Q 
in the z-th iteration of Algorithm [2] (line 10 or line 17), Sf(tt, h) < 1 by Lemma[6] Notice 
that P also includes two additional subcurves l n +i and l n +i whereas there is no variable 
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x n+ \ and x n+ 2 in formula (f>. These two curves are to resolve two special cases: when all 
variables Xi are 1, no ~^Xi appears in <f>, and when all variables Xi are 0, no Xi appears in cf>. 
Because of these two curves, we add two additional curves in line [19] and [21] to Q. Finally, 
by Observation^ 8 F (P,Q) < 1. 

Next, we show that curve Q visits every point in S. First of all, by the curves added to Q 
in line 19 and|21[ all a,j and bj, 1 < j < k, in S will be visited. It is sufficient to show that 
Q will visit all Cj points in S as well. Since formula <f> is satisfied, every clause Cj in <j> must 
be satisfied too. Fix clause Cj. At least one of the literals in Cj must have a truth value 1. 
If Xi £ Cj and Xi — 1, then by line [9] curve Q visits Cj. On the other hand, if ^x^ 6 Cj and 
Xi = 0, by line 16 curve Q visits Cj. We conclude that curve Q is feasible. 

Now we prove the (-<=) part: 

Let Q be a feasible curve with respect to P and pointset S. Notice that curve P consists 
of n subcurve li, 1 < i < n, each corresponds to one variable x,-. From the configuration of 
each li in c-squares, one can easily construct formula <j> with all of its clauses and literals. 

Imagine two point objects Oq and Op walk on P and Q respectively. We find the truth 
value of variable Xj in the formula by looking at the path that Oq takes to stay in 1-Frechet 
distance to Op, when Op walks on curve Zj corresponding to Xj. If Oq takes path A from 
Lemma [3] while Op is walking on l i7 then Xj = 1; whereas If Oq takes path B from Lemma 
[4]while Op is walking on li, then Xj — 0. Object Oq decides between path A or B, , when 
both Oq and Op are at point u. Lemma [5] ensures that once they start walking, Oq can not 
change its path from A to B or from B to A. Therefore, the truth value of a variable Xj is 
consistent . 

The only thing left to show is the reason that formula <j> is satisfiable. It is sufficient to 
show every clause of 4> is satisfiable. Consider any clause Cj. Since curve Q is feasible, it 
uses every point in S. Assume w.l.o.g. that Oq visits Cj when Op is walking along curve Zj. 
By Lemmas [5] and [6| this only happens when either (x^ appears in Cj and Xj = 1) or (^Xj 
appears in Ci and Xj = 0). Therefore, Cj is satisfiable. 

The last ingredient of the NP-completeness proof is to show that the reduction takes 
polynomial time. One can easily see that Algorithm [l] has running time 0(nk), where n is 
the number of variables in the input formula with k clauses. 

•4 



To show the correctness of above lemmas, we have implemented our reduction algorithm. 
We test our algorithm on a formula <fi with four clauses. This enables us to check all 
possible configurations of U in Algorithm [l] The program generates three sets, a pointset 
S = { s i> ffii c ii s 2, 52, C2, S3, 33, C3, 84, #4, C4}, a curve set L and a curve set C as follows. 

Imagine a polygonal curve which starts from point u, goes through points in S and ends 
at v. Our program generates all possible such curves and keep them in set C. Therefore, C 
contains almost 1,000,000,000 curves. 

Another set L includes all different configuration of curve U which corresponds to variable 
Xi in the formula. Since Xj or ^Xi or none could appear in one clause and the formula has 
four clauses, the set L contains 81 different curves. 

In our application, we compute Frechet distance between every curve in C and every 
curve in L. The results show that all the curves in C have Frechet distance greater than 1 
to curves in L except two curves < u, s±, 32, S3, 34, v > and < u,gi, S2, 93, S4, v >. In other 
words, for only 162 pairs of curves, we got: 

V^t e L : 5 F {[i, < u, si, 32, s 3 , 34, v >) < 1 and V/x € L : S F (fj,, < u, 31, s 2 , 33, s 4 , v >) < 1 

In addition to above tests, we verified the correctness of Lemma [6] in different cases. 
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3 Convex Polygon Case 

In this section, we address the following problem: given a convex polygon CP and a pointsct 
S in the plane, find a polygonal curve Q whose vertices are from S, and the Frechet distance 
between Q and a boundary curve of CP is minimum. Note that each point of S must be used 
in Q and it can be used more than once. In the decision version of the problem, we want to 
decide if there is a polygonal curve Q through all points in S, whose Frechet distance to a 
polygon's boundary curve is at most e, for a given e ^ 0. 

3.1 Preliminaries 

We borrow some notations from [5] as we make use of the algorithm in that paper to solve 
the decision version of our problem. For any point p in the plane, we define 23(p, e) = {q G 
K 2 : \\pq\\ ^ £} to be a ball of radius e centered at p, where || • || denotes Euclidean distance. 
Given a line segment L C M 2 , we define Q(L,e) = U p ez,CB(p, e) to be a cylinder of radius e 
around L. 

In this section, whenever we say polygon, we mean a convex polygon. Also, when we say 
a curve visits a point u, we mean that u is a vertex of the curve. We denote by CH(S) the 
convex hull of pointset S. 

For an interval I of points in P, we denote by left (7) and right (7) the first and the last 
point of I along P, respectively. Given two points u and v in Si, we say u is before v, and 
denote it by u ^< v when left (Pi [u]) is located before left ( _Pj [v] ) on P. Moreover, we say u is 
entirely before v (or v is entirely after u) and denote it by u -< v, when right (Pi [it]) is located 
before left(F l [w]) on P. 

The Decision Algorithm in (2] Given a polygonal curve P of size n (with starting point 
s and ending point t), a pointset S of size k and a distance e > 0, the algorithm in that 
paper decides in 0(nk 2 ) time, whether there exists a polygonal curve through some points 
in S in e- Frechet distance to P. 

Curve P is composed of n line segments P\. . .P n . For each segment P j; , Cj denotes the 
cylinder C(Pj,e), and Si denotes the set S n CV Furthermore, for each point d e C„ Pi[v] 
denotes the line segment Pi n 23 (v, e) [2]. A polygonal curve R is called semi- feasible if all its 
vertices are from S and Sp(R, P') ^ e for a subcurve P' C P starting at s. A point v £ Si is 
called reachable, at cylinder Cj, if there is a semi- feasible curve ending at v in CV 

Following is a brief outline of how the decision algorithm in [5] works: it processes the 
cylinders one by one from C\ to C n , and identifies at each cylinder Ck all points of S which 
are reachable at Ck- The reachable points for each cylinder C'k, k from 1 to n, is maintained 
in a set called reachability set, denoted by 31). ■ In the fc-th iteration of the algorithm, first all 
points in Sk which are reachable through a point in a set %i, for 1 ^ i < k, are added to 3lk ■ 
These points are called the entry points of cylinder Ck- We denote, by the leftmost entry 
point of cylinder Ck, which at this step, equal to q = min,,^,. left(Pfc[u]). Next, all points in 
Sk which are reachable through are added to Jlk- Finally, the decision algorithm returns 
YES , if !R n n 33(£,e) ^ 0. 

3.2 Decision Algorithm 

Let J" be a convex polygon of size n, S be a pointset of size k and e > be an input distance. 
For T, we call a curve, a boundary curve of CP, denote it by <r(CP, u), if it starts from point u 
on the boundary of CP, goes around the polygon on the boundary once and ends at u. 
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Figure 2 A problem instance 

► Definition 9. Given a pointset S, a convex polygon 7 and a distance e, a polygonal 
curve Q, is called feasible if Q € Curves(S) and a boundary curve z) exists such that 
5 F (a(7,z),Q)<e. 

As the first step of our algorithm, we execute the decision algorithm in fj] . Note that 
here as opposed to in fj] , the starting and ending point of the curve is unclear because the 
input is a convex polygon. Which point on the boundary we choose? The following lemma 
justifies our choice later: 

► Lemma 10. Given a convex polygon CP, a pointset S and a distance e, a necessary condition 
to have a feasible curve through S is: 5p{o'{QH{S),z),a('S',z 1 )) < e, where z is a vertex of 
GH(S) and z' is a point on the boundary of? s.t. \\zz'\\ < e. 

Proof. See Appendix. A 

Let x be the point with smallest cc-coordinate in 5* and x' be a point on the boundary of T 
s.t. \\xx'\\ < e. Furthermore, let p = tr(CP, x') (curve p has n+1 line segments p\,pi, • ■ ■ , pn+i) 
Run the decision algorithm in [2 , with parameters p, S and s. Result is the reachability sets 
3?i, 3?2, ■ • • , 3?n+i where each Hi maintains the reachable points at cylinder Cj. 

► Definition 11. Consider two consecutive reachability sets "JLi and 3tj, 1 < i < j < n. Let 
u be a point in Jii. Then, we call u a type A point at Cj if there exists a semi- feasible curve 
which contains u as its vertex and ends at Xj. Otherwise, we call u a type B point at Ci and 
we call Xj the rival of u. 

► Observation 3. Let u and v be type A and type B points at cylinder Ci, respectively. Then, 
u < v. 

► Observation 4. A type B point u at cylinder Ci is: 

•a Bl point at Cf. when u doesn't reach to but it reaches to some other point v in 71 j 
(see Figure |3]i). 

a B2 point at Ci, when u does not reach to any point in 31 j, but it reaches to a point in 
R-k, i < j < k (see Figure^). 

a _B3 point at Ci, when u does not reach to any point in 3^, i < j < k (see Figure^). 

After running the algorithm in [2], we process the reachability sets 3?i, 3^2, • ■ ■ one 
by one in order, and we identify the types of the points for every point in each set. Notice 
that a point u may be located in multiple cylinders, so it might be reachable at more that 
one cylinder and thus be in more than one reachability set. 

Let 7r and n be the upper and the lower chain of T, respectively. Let Tube(T) be the 
union of all C(Pj, e), where Pi is an edge of T, 1 < i < n. Tube(7r) and Tube(p) are defined, 
analogously. 
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(a) 





(b) 



(c) 



Figure 3 (a) Point u is a Bl point at Ci (b) Point u is a B2 point at Ci (c) Point m is a B3 
point at Ci. 




Figure 4 Proof of Lemma 14 



► Definition 12. We call a point a Twice- TypeB point, if it is of type B at two cylinders. 

► Definition 13. We call a connected area within Tube(J'), shared between two cylinders, 
one corresponding to an edge in n and another corresponding to an edge in fi, a Double- 
TypeB area, if it contains a Twice- TypeB point. 



► Lemma 14. Given a polygon CP, a pointset S and e > 0, at most two Double-TypeB 
exists. 



areas 



Proof. Assume w.l.o.g. that CP is stretched horizontally (see Figure [4]) (In the case that CP is 
vertically stretched, decompose it into right and left chains and the rest of argument is the 
same as here). 

Assume for the sake of contradiction, that there are three such areas and a point w is 
a Twice- TypeB point located in the middle one. (see Figure [4]). Let w' and w" be the 
rivals of w in the upper and lower chain of CP, respectively. Assume w.l.o.g. that w and w' 
are located in two consecutive cylinders (within Tube(7r)) which share 25(pi,e). Similarly, 
assume w.l.o.g. that w and w" are located in two consecutive cylinders (within Tube(/i)) 
which share 23 (j>2, e). Since w is a type B point and w' is the rival of w, edge ww' does not 
cross circle B(pi,e). Because of the same reason, edge wvr does not cross circle 23(p 2 ,e)- 
This implies that, vertices of T to the right of pi has a y-coordinate less than p x and vertices 
before P2 has a y-coordinate greater than p 2 > meaning that p 2 has the lowest y coordinate 
among vertices of CP. Therefore, no Double-TypeB area exists to the right of the area in 
which w is located, a contradiction. 



► Lemma 15. There exists a feasible curve iff Algorithm^ returns YES. 

Proof. First (<=): It is easy to observe that conditions in lines [5] [6] [8] are necessary to 
have a feasible curve through S. It suffices to show for the curve a built by Algorithm [3j 
5f{ol, p) < e. 

We show this by induction on the number of edges in p. To handle the base case of 
the induction, assume that p has an additional edge consist of only point x' . Imagine a 
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point object Oy walking on the boundary of CP, starting from point x' and imagine another 
point object Q walking on curve a starting from x while keeping distance < 1 to Oy. Since 
distance Oy and a is less than e at the start, the base case of the induction holds. 

We show fly can walk the whole p and ends at x'. Assume inductively that in the loop 
in line we have processed 3?i to CR n _i, and now we are about to process CR„. So every 
point in Si—i is in a and 0j> can walk to a point in pi-\ in e distance to Q . Let v be the 
leftmost point at Cj such that end(a) u. If v — \i, by Observation [2j we can add every 
point in Si to a and Oy can proceed to pi. Otherwise; since A^ is a reachable point, a point 
u must exist in CR^, k < i such that u reach A^. Assume that u is after all points in direction 



p which can reach to A^. It suffices to show point u is a vertex of a, so that in line 14 by 
reaching to u, the algorithm stops removing vertices in a and connects it to A^. Two cases 
happen here: (i) u 6 Si-i, and some points in Si-\ are type A and some are type B points. 
Then, type B points are removed from a (because they can not reach A^ by definition), and 
u will be connected to \i (see Figure j5|. Therefore, Oy can walk to the next edge on p. (ii) 
When u£%ki k < i — 1, then observe that because of the convexity of the polygon, u reach 
points in 3ij which can not reach A^, k < j < i (see Figure pj. Therefore, u is a vertex in a. 

Now we show that curve a can be modified such that it visits every Twice-TypeB point 
when Oy walks on 7r (with the same argument, curve a can be modified to visit all such 
points when 0j> walks on p). Let's call Double- TypeB areas, area I and area II. Let b be the 
first type B point in direction 7r at area I. Assume w.l.o.g. that b is in Si. Let u be a point 
after all points in direction 7r which can reach Aj+j.. Because of the convexity of polygon, 
point u reaches b. It is clear that as soon as a reaches b, it can visit all type B points at Ci 
in sorted -< order. Let d be the leftmost point in Si+i which the last type B point at Ci can 
reach. If d is a vertex of a, then a now has all Twice-TypeB points at area I. When d is not 
a vertex of a, because of the convexity of the polygon, still d can be connected to a vertex of 
a which is reachable from d. Therefore, one can modify curve a to visit Twice-TypeB points. 
(=>): Let Q be a feasible curve through S. Assume that there is no Twice-TypeB point 



among points in S. Let z be any point in S. Then, by Lemma 14 if z lies in both Tube(ii) 
and Tube(p), then it can be type B only with respect to one of tt or p. Assume w.l.o.g, that 
z is a type B point in the upper chain. Therefore, z is a type A point in the lower chain and 
it will be visited by a. Now, if there are some Twice-TypeB points in S, it is easy to show 
that z will be visited by one of /3 or 7 curves in our algorithm. 

•4 

► Theorem 16. Given a convex polygon CP with n edges and a set S of k points, we can decide 
in 0{nk 2 ) time whether there is feasible curve Q through S for a given e 0. Furthermore, 
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Algorithm 3 Decision Algorithm 

Input: A convex polygonal CP, pointset S and distance e 
1: let x be the point with smallest x-coordinate in S 
2: let x' be a point on the boundary of CP s.t. ||xx'|| < e 
3: let p — cr(CP, x') (curve p has n+l line segments) 

4: run the decision algorithm in [2J, with parameters p, S and e 
5: return NO if a point in is not in any 3?^, l<i<n+l 
6: return NO if %n+\ H e) = 
7: process reachability sets from 3?! to CR n in order, 

to identify types of points in each CR^ 
8: return NO if a point exists of type B3 

in every cylinder at which that point is reachable 

9: a «- 

10: a <- a © x 

11: for i = 1 to n do 

12: let v be a leftmost point at C, s.t. end(a) v 
13: if «^ Aj then 

14: remove end(a) until a vertex u of a is found s.t. u 

15: a -f- a© (every point in Si in sorted -< order) 

16: if a Twice-TypeB point exists then 

17: f3 <— modify a s.t. it visits those points in 7r 

18: 7 <— modify a s.t. it visits those points in p 

19: return YES if a or /3 or 7 include all points in S 



a feasible curve Q which minimizing the Frechet distance to the boundary curve of CP, can be 
computed in 0(nk 2 log(nfe)) time. 

4 Conclusions 

In this paper, we investigate the problem of deciding whether a polygonal curve through 
a given point set S exists which is in e-Frechet distance to a given curve P. We showed 
that this problem is NP-Complete. Also, we presented a polynomial time algorithm for the 
special case of the problem, when a given curve is a convex polygon. 

Several open problems arise from our work. From our first result, one could investigate 
some heuristic methods or approximation algorithms. From the second part, in particular, it 
is interesting to study the problem in the case where the input is a monotone polygon, a 
simple polygon or a special type of curve. 
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Table 1 Distance between pair of points is less or equal to one 
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(-4,-1) 

Figure 6 Red curve is curve B. Blue curve is an example of curve U which corresponds to variable 
Xi in formula <j> with four clauses C\, C2, C3 and C4. The occurrence of variable x% in the clauses is: 
->Xi G C\, -<Xi G C2, aij G C3 and Zi G C4. 



Proof of Lemma |4j 

Proof. Consider two point objects 0^ and 0^ traversing Zj and B, respectively (Figure [6] 
depicts an instance of U and B). To prove the lemma, we show that Oj, and 0^ can walk 
their respective curves, from beginning to the end, while keeping distance 1 to each other . 

The base case of induction holds as follows (see Figure [9] for an illustration) : Table [2] 
lists pairwise location of 0l and 0#, where the distance of each pair is less or equal to 1. 
Therefore, Ob can walk from u to gi while keep distance one to 0j> 

Assume inductively that 0^ and 0^ have feasibly walked along their respective curves, 
until 0b reached gj. Then, as the induction step, we show that 0^ can walk to Sj+i and 
then to <7j + 2 , while keeping distance 1 to 0^. This is shown in Table [3] (see Figure 10 for an 
illustration) . 

Finally, if k is an odd number, then g^v is the last segment along B, otherwise, s^i is the 
last one. In any case, that edge crosses circle 23(77, 1): where rj is the last vertex of Zj before v 
(point rj is computed after the condition checking in line 14 of Algorithm [lj . Therefore, 0b 
can walk to v, while keeping distance 1 to 0^. 

•4 •* 



Proof of Lemma [5j 
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(a) 




9i = (1,1) 



u = (-9,-l) 



(-4,-1): 



si = (-1,-1) 



(b) 




(c) 



Figure 7 Base case of induction in the proof of Lemma [3] 

Proof. Notice that we have placed cli+i points far enough from cli points so that no curve 
can go to cli+i and come back to cli and stay in 1-Frechet distance to Therefore, to prove 
the lemma, we only focus on two consecutive c-squares. We show that no subcurve V C 



exists such that (for an illustration, see Figure 11) : 
h 5p(l', Sjg)) < 1 because: 

for all j, 1 < j < k, point Cj is always a vertex of U. A point on li in distance 1 to Sj 
lies before Cj in direction ij , while a point on li in distance 1 to point gj lies after Cj in 
direction li . Since dist(cj, sjgj) > 1, no subcurve I' C ^ exists such that 5f(V ,sjg~j) < 1. 

8f(1', < SjCjgj >) < 1 or 5_p(Z', < gjCjSj >) < 1, because: 

For all j, 1 < j < /c, is a vertex of A point on li in distance 1 to Sj lies 
before tUj in direction li , while a point on i, in distance 1 to point gj lies after Wj 
in direction ij . Since dist(iVj, sjcj) > 1 and dist(wj ,g~jgj) > 1, no subcurve Z' C 
exists such that Sp(l', < SjCjgj >) < 1. Similarly, no subcurve I' C ^ exists such that 
S F {l',<gjC jSj >) < 1. 



<5f(^', < s 3 Sj + i >) < 1 or Sf(1' , < 9j9j+i >) < 1 because: 



Vertex aj of U guarantees the first part as distipti, SjSj+i) > 1, and vertex fa of k 
guarantees the second part, as dist(/3i, g^gj+i) > 1. 
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Figure 8 Proof of Lemma 3 
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u = (-9,-l) \ ( - 4 '- 1) y 
Figure 9 Base case of induction in the proof of Lemma jZ] 



*i = (-1,-1) 



h Sf(1',< CjCj+i >) < 1, because dist{oti, afcj+i) > 1 

Sf(1',< uci >) < 1, because dist((— 4, — l),ncT) > 1 
h 5f(1',< Cj9j+i >) < 1, because dist(ai, cjgj+i) > 1 
h 5p(l',< CjSj+i >) < 1 5 because dist(cti, CjSj+i) > 1 

Sf(1', < Ckv >) < 1, because dist(rj, Ckv) > 1 

Proof of Lemma 1 1 01 

Proof. We prove the lemma by induction on the number of edges in CH(S). Let eie2..e n 
be the edges of 7, numbered after an arbitrary vertex of 7 in clockwise order. Obviously, 
to have a feasible curve, every point of S must be located within some cylinder C(ei,e). To 
establish the lemma, we show that when a feasible curve exists through S, the condition 
holds. 

Imagine a point object 0c which cycles QH(S), starting from a vertex of GH(S), say 
point u, and ending at the same point. Let v! be a point on the boundary of 7 such that 
||uu'|| < e. Imagine another point object Op which starts from it', walks on the boundary of 
7 until it reaches the same point v! . Both of the objects walk clockwise. 

To handle the base case of the induction, assume that the convex hull has an additional 
edge consist of only point u. Since the distance between 0c and Op is less than e at the 
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9j+i 




Figure 10 Proof of Lemma 4 
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Table 2 Pairwise location of 0_g and Ol, to prove the base case of induction in Lemma 4 



start, the base case of the induction holds. Assume inductively that c an d Op walk along 
their path, keeping distance e to each other, until Oc reaches to vertex v in GH(S) and Op 
reaches to point v' E ej on the boundary of V where \\vv'\\ < e. 

Let w be the next vertex on CH(S) after v. If w is located within the same cylinder 
C(ei,e) as v, then by Observation [2] the lemma holds. Otherwise, assume that w is in 
G(ej, e), i < j and w' is a point in tj where ||wu/|| < e. Consider the £-ball around each of 
the vertices of the polygon between points v' and w' . It suffices to show that edge e = mi> 
crosses each of those balls. 

Assume, for the sake of contradiction, that edge e does not cross one of those balls, say 
the one around vertex p. Assume w.l.o.g. that all points in S lie to the right of e. Then, two 
cases occur as illustrated in Figure 12 case (a), where CB(p, e) lies to the right of e, which 
contradict the fact that the polygon is convex; or case (b), where 23(p, e) lies to the left of e, 
which contradicts our assumption that a feasible curve exists, because no point in S is in 
e-distance to vertex p. < < 
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Table 3 Distance between pair of points is less or equal to one 
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Table 4 Proof of Lemma 3 the base case of induction 



